A computationally efficient speech/music discriminator for radio recordings

نویسندگان

  • Aggelos Pikrakis
  • Theodoros Giannakopoulos
  • Sergios Theodoridis
چکیده

This paper presents a speech/music discriminator for radio recordings, based on a new and computationally efficient region growing technique, that bears its origins in the field of image segmentation. The proposed scheme operates on a single feature, a variant of the spectral entropy, which is extracted from the audio recording by means of a short-term processing technique. The proposed method has been tested on recordings from radio stations broadcasting over the Internet and, despite its simplicity, has proved to yield performance results comparable to more sophisticated approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech-Music Discrimination from MPEG-1 Bitstream

This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, ...

متن کامل

Towards a Software-radio Enabled Broadcast Media Navigator

New consumer services are transforming the Hertzian broadcast media into a rich source of multimedia content. A Broadcast Media Navigator combining Softare Radio elements with Audio Indexing techniques and digital audio recording is proposed. Results from a PC-based demonstrator implementaing a simple speech/music discriminator are presented.

متن کامل

A Speech/music Discriminator -based Audio Browser with a Degree of Certainty Measure

In recent years the field of content-based audio signal classification and retrieval has gained a growing amount of interest among researchers around the world. This paper describes a technique, which is used to automatically discriminate audio signals between speech and music. Our goal was to achieve reliable classification results using computationally inexpensive time-domain features. The cl...

متن کامل

A new approach for audio classification and segmentation using Gabor wavelets and Fisher linear discriminator

Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified ...

متن کامل

Unsupervised Feature Learning for Speech and Music Detection in Radio Broadcasts

Detecting speech and music is an elementary step in extracting information from radio broadcasts. Existing solutions either rely on general-purpose audio features, or build on features specifically engineered for the task. Interpreting spectrograms as images, we can apply unsupervised feature learning methods from computer vision instead. In this work, we show that features learned by a mean-co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006